Basic Statistics

Raw Counts

Name Value
Rows 2,078,580
Columns 27
Discrete columns 17
Continuous columns 10
All missing columns 0
Missing observations 89,937
Complete Rows 2,029,126
Total observations 56,121,660
Memory allocation 564.1 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 12 columns ignored with more than 50 categories.
## incident_id: 2078580 categories
## incident_location: 17315 categories
## call_description: 250 categories
## category: 223 categories
## call_code: 344 categories
## called_at: 2069106 categories
## called_at_est: 2069106 categories
## case_id: 352342 categories
## zip_code: 59 categories
## scout_car_area: 228 categories
## neighborhood_name: 206 categories
## geom: 18265 categories

QQ Plot

## Warning: Removed 18 rows containing non-finite values (`stat_qq()`).
## Warning: Removed 18 rows containing non-finite values (`stat_qq_line()`).

Correlation Analysis

## 13 features with more than 20 categories ignored!
## incident_id: 2029126 categories
## incident_location: 16887 categories
## call_description: 247 categories
## category: 220 categories
## call_group: 24 categories
## call_code: 341 categories
## called_at: 2020111 categories
## called_at_est: 2020111 categories
## case_id: 346224 categories
## zip_code: 44 categories
## scout_car_area: 228 categories
## neighborhood_name: 205 categories
## geom: 17800 categories
## Warning in cor(x = structure(list(intake_time_in_minutes = c(1.4, 1.3, 1.5, : the standard
## deviation is zero

Principal Component Analysis

## 11 features with more than 50 categories ignored!
## incident_id: 2029126 categories
## incident_location: 16887 categories
## call_description: 247 categories
## category: 220 categories
## call_code: 341 categories
## called_at: 2020111 categories
## called_at_est: 2020111 categories
## case_id: 346224 categories
## scout_car_area: 228 categories
## neighborhood_name: 205 categories
## geom: 17800 categories
## Warning in plot_prcomp(data = structure(list(incident_id = c("202406904463", : The following features are dropped due to zero variance:
##  * is_officer_initiated_No